Showing 117 of 117on this page. Filters & sort apply to loaded results; URL updates for sharing.117 of 117 on this page
Quantization Fundamentals with Hugging Face
Static Quantization with Hugging Face `optimum` for ~3x latency ...
Quantization with Hugging Face Optimum
Hugging Face Quantization | Accelerated inference on NVIDIA GPUs – XHYY
The Story of Hugging Face Model Quantization
Quantization Fundamentals with Hugging Face | Chris Holland
FLUX.2 [dev] Quantization - a Hugging Face Space by multimodalart
Quantization · Hugging Face
Linear quantization with Hugging Face Quanto🤗 | by kirouane Ayoub | GoPenAI
Efficient Multi-Model Inference with 4-bit Quantization in Hugging Face ...
Model Quantization with 🤗 Hugging Face Transformers and Bitsandbytes ...
New course on linear quantization with Hugging Face | DeepLearning.AI ...
🤖 Hugging Face has updated their quantization docs in Transformers ...
Embedding Quantization - a Hugging Face Space by SwastikM
LLM Quantization Advanced - a Hugging Face Space by openfree
LLM Quantization - a Hugging Face Space by bhaskartripathi
Quantization - a Hugging Face Space by rakesh9177
Quantization - a Hugging Face Space by PEFT
How to Run DeepSeek Locally: Using Hugging Face and Quantization for ...
Online Course: Quantization Fundamentals with Hugging Face from ...
Quantizing Models from Hugging Face Using BitsnBytes | Quantization ...
HuggingFace团队亲授大模型量化基础: Quantization Fundamentals with Hugging Face ...
Quantization Fundamentals with Hugging Face | Pandurang Nayak
Quantization GPTQ - 🤗Optimum - Hugging Face Forums
kernels-community/quantization-gptq · Hugging Face
HuggingFace团队亲授大模型量化基础: Quantization Fundamentals with Hugging Face-CSDN博客
Which .GGUF Should You Download? (Hugging Face Quantization Guide ...
@macadeliccc on Hugging Face: "Benefits of `imatrix` quantization in ...
New course with Hugging Face: Quantization in Depth 🤗 : r/PostAI
Kameshr/LLAMA-3-Quantized · Hugging Face
KIST-robot-intelligence/Qwen-14B-Chat-GGUF-Quantization · Hugging Face
Getting Started with LLaMA 3 on Hugging Face: 4-Bit Quantization Made ...
Quantization Formats And Cuda Compute Capability Support - a Hugging ...
Quantize Hugging Face model to AWQ int4: A Step-by-Step Guide with ...
Deep Dive into Hugging Face Quanto: A Comprehensive Guide to ...
kevinbazira/aya-expanse-8b-gptq-4bit · Hugging Face
xaviviro/finetuned-mistral-7b-quantized-gguf · Hugging Face
Llama2 Quantized Deploy - a Hugging Face Space by Kunalpal216
Thank you DeepLearning.AI and Hugging Face for amazing course on ...
latestissue/rwkv-4-pileplus-ggml-quantized · Hugging Face
New course with Hugging Face: Quantization Fundamentals - YouTube
模型卡片 - Hugging Face 文件
Quantized Models for Wan-AI/Wan2.1-T2V-14B – Hugging Face
Quantized Models for JackChew/Qwen2-VL-2B-OCR – Hugging Face
Quantized Models for deepseek-ai/DeepSeek-R1-0528-Qwen3-8B – Hugging Face
Quantized Models for Qwen/Qwen-Image – Hugging Face
Quantized Models for clouditera/secgpt – Hugging Face
Quantized Models for Qwen/Qwen2.5-32B – Hugging Face
Quantized Models for shisa-ai/shisa-v2-qwen2.5-7b – Hugging Face
Quantized Models for pfnet/plamo-2-8b – Hugging Face
Quantized Models for google/gemma-3-27b-it – Hugging Face
CRD716/ggml-vicuna-1.1-quantized · Hugging Face
Quantized Models for microsoft/Phi-3.5-mini-instruct – Hugging Face
akshathmangudi/llama3.1-8b-quantized · Hugging Face
Quantized Models for hexgrad/Kokoro-82M – Hugging Face
Quantized Models for openai-community/gpt2-xl – Hugging Face
Quantized Models for Qwen/Qwen3-30B-A3B-Instruct-2507 – Hugging Face
Quantized Models for Qwen/Qwen2.5-1.5B-Instruct – Hugging Face
Quantized Models for deepseek-ai/DeepSeek-V3 – Hugging Face
neuralmagic/Mistral-7B-Instruct-v0.3-quantized.w8a8 · Hugging Face
Quantized Models for openai/gpt-oss-20b – Hugging Face
Quantized Models for Qwen/Qwen2.5-Coder-14B-Instruct-GGUF – Hugging Face
frinedl/ggml-quantized-whisper-en · Hugging Face
Quantized Models for Qwen/Qwen2.5-Omni-7B – Hugging Face
Quantized Models for deepseek-ai/DeepSeek-R1-0528 – Hugging Face
Quantized Models for facebook/nllb-200-3.3B – Hugging Face
Quantized Models for Qwen/Qwen2.5-Coder-1.5B – Hugging Face
mohan11111/seallm7bint4symmetricquantization · Hugging Face
kiri-ai/gpt2-large-quantized · Hugging Face
Quantized Models for facebook/musicgen-small – Hugging Face
FabbriSimo01/Facebook_opt_1.3b_Quantized · Hugging Face
Quantized Models for Qwen/Qwen2.5-0.5B-Instruct – Hugging Face
Quantized Models for microsoft/Florence-2-large – Hugging Face
Quantized Models for KBLab/kb-whisper-large – Hugging Face
Quantized Models for Qwen/Qwen2.5-VL-3B-Instruct – Hugging Face
Quantized Models for Qwen/Qwen2-VL-7B-Instruct – Hugging Face
@Jaward on Hugging Face: "PyTorch implementation of the Self ...
huggingface/documentation-images · Embedding Quantization blogpost
@macadeliccc on Hugging Face: "Quantize 7B paramater models in 60 ...
PEFT and Quantization | huggingface/trl | DeepWiki
@ronantakizawa on Hugging Face: "Introducing AWQ and GPTQ quantized ...
A Guide to Supervised Fine-Tuning and 4-Bit Quantization for Language ...
@eaddario on Hugging Face: "Experimental global target bits‑per‑weight ...
@ybelkada on Hugging Face: "Check out quantized weights from ISTA-DAS ...
How to Use Hugging Face: A Comprehensive AI Guide
Quantization Explained: Why the Same LLM Gives Better Results on High ...
Introduction to Quantization cooked in 🤗 with 💗🧑🍳
Slow performance in Quantization · Issue #309 · huggingface/text ...
How to Use Hugging Face: Beginner’s Guide (2026)
@cbensimon on Hugging Face: "🚀 ZeroGPU now supports PyTorch native ...
@sayakpaul on Hugging Face: "It's been a while we shipped native ...
huggingface/documentation-images at main
huggingface/documentation-images at HEAD
blog/embedding-quantization.md at main · huggingface/blog · GitHub
Quantized Models for HuggingFaceTB/SmolVLM2-500M-Video-Instruct ...
hugging-quants (Hugging Quants)
GitHub - edcalderin/huggingface-ragflow: This project implements a ...
HuggingFace Paper Explorer
Blog – PyTorch
DrishtiSharma/llama-2-chat-gptq-block-quantization-even-layers ...